Lara State
Regionalized models for Spanish language variations based on Twitter
Tellez, Eric S., Moctezuma, Daniela, Miranda, Sabino, Graff, Mario, Ruiz, Guillermo
Spanish is one of the most spoken languages in the globe, but not necessarily Spanish is written and spoken in the same way in different countries. Understanding local language variations can help to improve model performances on regional tasks, both understanding local structures and also improving the message's content. For instance, think about a machine learning engineer who automatizes some language classification task on a particular region or a social scientist trying to understand a regional event with echoes on social media; both can take advantage of dialect-based language models to understand what is happening with more contextual information hence more precision. This manuscript presents and describes a set of regionalized resources for the Spanish language built on four-year Twitter public messages geotagged in 26 Spanish-speaking countries. We introduce word embeddings based on FastText, language models based on BERT, and per-region sample corpora. We also provide a broad comparison among regions covering lexical and semantical similarities; as well as examples of using regional resources on message classification tasks.
- North America > United States (0.14)
- South America > Argentina (0.05)
- North America > Cuba (0.04)
- (35 more...)
- Information Technology > Services (0.93)
- Health & Medicine (0.68)
Human Activity Recognition using Attribute-Based Neural Networks and Context Information
Lüdtke, Stefan, Rueda, Fernando Moya, Ahmed, Waqas, Fink, Gernot A., Kirste, Thomas
We consider human activity recognition (HAR) from wearable sensor data in manual-work processes, like warehouse order-picking. Such structured domains can often be partitioned into distinct process steps, e.g., packaging or transporting. Each process step can have a different prior distribution over activity classes, e.g., standing or walking, and different system dynamics. Here, we show how such context information can be integrated systematically into a deep neural network-based HAR system. Specifically, we propose a hybrid architecture that combines a deep neural network-that estimates high-level movement descriptors, attributes, from the raw-sensor data-and a shallow classifier, which predicts activity classes from the estimated attributes and (optional) context information, like the currently executed process step. We empirically show that our proposed architecture increases HAR performance, compared to state-of-the-art methods. Additionally, we show that HAR performance can be further increased when information about process steps is incorporated, even when that information is only partially correct.
- Europe > Austria > Vienna (0.14)
- Europe > Germany (0.04)
- South America > Venezuela > Lara State (0.04)
- North America > United States (0.04)
- Workflow (1.00)
- Research Report > Promising Solution (0.48)
Training self-driving cars for $1 an hour
Every day for over four years, Ramses woke up in his home in Barquisimeto, Venezuela, turned on his computer, and began labeling images that will help make self-driving cars ubiquitous one day. Through a microtasking platform called Remotasks, he would identify mundane objects that line the streets everywhere -- trees, lampposts, pedestrians, stop signs -- so that autonomous vehicles could learn to notice them, too. Like many Venezuelans, Ramses turned to microtasking when his country plunged into economic turmoil. The gig gave him the opportunity to earn American dollars instead of the local currency, which is subject to extraordinarily high inflation. "I would work Sunday to Sunday," Ramses, who asked to use only his first name for privacy reasons, told Rest of World over WhatsApp.
- South America > Venezuela > Lara State > Barquisimeto (0.25)
- North America > Canada > Ontario > Toronto (0.15)
- Asia > China > Guangdong Province > Shenzhen (0.05)
- (5 more...)
- Automobiles & Trucks (1.00)
- Transportation > Ground > Road (0.89)
- Transportation > Passenger (0.75)
- Information Technology > Robotics & Automation (0.75)